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ABSTRACT 

Efforts to ittplement affirmative action programs will 
not be effe::tive until alternatives to the traditicral use -of 
standardized tests in a dmissions^ grad ing, and tracking are 
considered. These tes't scores are used in college admissions to 
predict college grades.^ Some kind of selective admissions process is 
needed^ but admissions procedures shoiald be designed to select 
students, and not to track them. Modifying admissions policies is the 
best way to remedy *:he unequal opporturi ty posed by tracking. 
Alternatives to the standardized t^est for admissions must be found if 
minorities are to gain equal access to higher education. 
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One of the most gratifying outcome, of th= controversy surrounding 
the recent Bakke case was that the higher educat n estab.l i shment lined 
up squarely beh i nd the Un i vers ity of California supporting minority- 
oriented affirmative action plans in collect adnis' -ons. Even before lUe 
decision came down, representatives of the majcr :ucational associations 
expressed their determination to continue a d even expand minority-admis- 
sions programs. 

While these public pronouncements ^ ^ u- rer.^;3ur'ng to those 
of us who.. are committed to expanding educction.. j^nort nitie: f minorities, 
it Is not clear just how such lofty intentior - to 

into concrete actions that will, in fact, siren ]t.h 
efforts. No matter how committed they may be to r-^- 
most educators are reluctant to recr-Oiize that : e 
practices still pose serious obstacle-^ to -■j'^^- ^ 
and the disadvantaged. Specif ical i_v, the ac 
admissions, grading, and ability tracking. Stand" c; 
noted, jplay a prominent part in each these ac ' < 
talk today is that our efforts to impl^nient aff^^ dti ^ 
not be very effective until we are willing to cc^s'j'e' 
traditional use of standardized tests ^'-^ admfs ^ns. r v* i, and tracking. 

Most of the controversy about standar :ed testii has emphasized 
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whai might be :al led the "construct valic .cy" of these inst- rrents. Are thpy 
"culturally biased"? Do they accurately reflect the acadL. c abilities r 
p-^rsons who are not from white, micidl e-c ?hs5 backgrounds? In my judcmer , 
these concerr nave been misplaced, since controversy ov^— what the te * - 
"really" mec ^ i has tended to obscure a nurh more f undamerr:':: ' problem: T 
tne way the r-^szs are used is basicallv nappropriate tc t ^e purposes 
ecjcation. is misuse of tests, in turr> poses special Irstacles to ne 
educat -^nal - H'-Iopment of disadvantaged rr nority groups. 

""e^O" msidering my specific obje-:-:ions tc. current • ses of tests 
ad-^iss'o^s. ^::ing, and tracking, it shou:a be made clear that my argurrr- 
3r-^ pre-.- : on one fundamental asssmption: namely, that the m.^ or 
puroo:- . iher educational institution: is t:^ educate students . In c:her 

wor'js, c '.iTiental mission of e^y cc^.e::e or university is to produce 
cer' -in ira - changes in the uder or, mere simply, to make a c'iffer- 
encf^ in :h * . Kent's life. Vhis ncept ion of the purpose of higher insti- 
tuti -^ur-- resembles the eco^«f' ist ^ nc::ion of "value added." 

Let' fi^ :t consider the u of - ests in admissions. Perhaps the 
best way tc up\ir:--stand the use of 'tf": i- admissions is to ask an admissions 
officer or friu ty member why teszs ^re :sed to decide which students are 
shouic be adm- 'ted or rejected. The "rost ■ ke]y response to this questi-^n is> 
"because the tests predict grades in colle;^/ Forcetting for the. moment that 
the college grade point average leaves sorrBt^ring to be desired as a measure of 
stucent learning, let us simply consider xrie educational implications of the 
prediction argument: Peopl e. with high scores on an admissions test are 
preferred over those with low scores on grounds that they will subse- 

quently earn higher grades in college. 'n my judgment this "preJiction 
arr]iiii)ent is a weak justification for the u nf ^any selection device, simply 



because preifction may have little, if iny^ning, to to wiith the educational 
or ".dlue added'' mission of the institution. 

"o illustrate the fallacy inherent in -e rredictic arc;ijment, suppose 
that -\ college was admitting all the w^onc stucents, S( that the students 
learn- :i ' absolutely nothing from their colle*- exoerience. Even in such an 
extrerr^ case, the admissions tests coul st 1 r--edict o^ades. In the same 
vein, f on- could administer college admissi^-s tests to h -gh school seniors, 
put t- »f? in a state of suspended animation f^r four years, then revive thefi: 
and g^ then a set of final examinatiorrs, thr college admissions tests would 
still r.i - validity" in D-edictinq performance on the final examinations. As 
a matte- fact, since different students are likely to learn at different 
rates, tands to reason that the qreatest student learning may be occurrinc 
at tho*^' r 'leges where test: hr 'e tne poorest predictive "validity." 

jL.^jible rejoinder - .his obj^^tion is that the criterion used to 
justify T ssions testing- college grade point average--is indeed a 

measL-e - .nat the student ^-s learned in college. While there has been 
very lit' --^^.earch on this ^.."damental problem, what -evidence there is fails 
to SLippo' . it. One study (H-T-ris, 1970) examined pretest-posttest gains on 
.the Colle:e L^el Examination rogram admini stered befcre and after exposure 
to various courses. As it tur-s out, students who got failing or near-failing 
gra-Tr-s showed ^est score ga^ns comparable to those of students with high 
grcJes. Is possible tha:: standardized testinci, when carried ,out on a 
pret < 5t-post' St basis, of ^ -^rs a much better means of assessing student 
progress than 'he grade poin average? Should our dependence on tests shift 
from its curre'* emphasis on selection and prediction to a greater emphasis on 
learning and development?. 

The main objectiW to traditional grading practices, of course, is 



that the college grade point average is a rel atlve measure that^simply 
ranks students from best to worst without any Necessary reference to what the 
student has learned as the result of the educat\ion/il program. Why institu- 
tions have not relied more on standardized tests to assess cognitive learning 
has always been a puzzle- Many professors argue that the tests are somehow 
inadequate: too . superficial, ^^-4^:3complete in coverage, culturally biased, and 
so forth. Professors' heavy reliance on^these same tests to decide under- 
graduate admissions, advance'd placement, and admission to graduate and 
professional schools belies these objections. If academics are perfectly 
willing to use standardized tests to evaluate the performance of prospective 
students, how come such tests suddenly become inappropriate measures of 
learning once the student is admitted? 

One difficulty in using standardied tests to monitor student develop- 
ment is the way they are scoreri^-^^^irtual ly every test-maker today reports 
student performance only ir normative terms: percentiles, standard scores, 
stanines, and so on. These norm-based scores show how the individual performs 
only in relation to others. Such scores are useful for selecting and screen- 
ing (i.e., identifying the "best" students), but they are difficult to use 
for measuring change or growth. Lacking any absolute referent, it is diffi- 
cult to know from successive administrations of such tests if, or how much, 
the person's performance level has improved. (The traditional letter grade 
system suffers, of course, from the same limitation, particularly if students 
are graded on the curve.) 

There is no good reason why such tests cannot be used to produce 
information on changes in student's absolute level of performance: for 
example, in the fiumber or percentage of items answered correctly. In addi- 
tion, performance data on individual test items could be provided to diagnose 



particular strengths and ^eak/uc^ses or to measure growth in more s .fic 
areas of skill or know ' Psychometr i c i ans have discouraged ,tinq 
companies from report in .It*^ for individual test items r *:he oi.incs 

that such information ts uin --•'^ V:hle. However,- individual ite data be 
highly reliable if, th& ^rirr f nc^:ed fcr groups of students.^ f ic - 
leges and uni ver^sitie: i lat us- staneardized tests demand the* r*e jlr be 
ri.'po^ted in r--.v scor' ; nrm rr xh^t item data be included, "ne tes 
companies will -r^obablv p- .1 .e inem. S-Jch demands could be macrr :r most 
the tests now 1 wide: ac u: : the achievement and cdvanc Tlac^e : 
tests of the Crllege F a amination Board (CEEB), the l-^ge L eVt-1 

Examination Program :■: ' . ~ ^ ndergraduate program of the Grac..*e R- i 
Examination, and th-^ annus de- ices to select students for -uate 1 J 
professional schools. ' 'V^vtcriig the additional information thL^ vtoiw 
available to teachers vid sstuc!^- ts, the cost of providing it wou'r: be t. - 
ial. Given that so r uh ^are i-.. devoted to writing and pretestiirr iters ?\ 
that these activities nt fcr a major share of the cost of ^t^rdardi 'oc 

test construction, V' -ia" benefits from these developm^-^L J eff:^'±s 

should not be lost ^ n«, :ruction of "scales" and the compi.tatl, 0^ 

percentiles, standard •. ^nar ether normative measures. 

Why do the te: s ^'ersist in limiting their feedbc :o itcttio- 

tive measures when ? information is of limited value to --nt: and 
institutions? There 1 : e at least two explanations. Firs^i-. ^ - ps ^cho- 

metricians wno contrc technical aspects of the test industry - e b-^come 

mesmerized by the st ' ic^.l properties of standard scores and oy ::ie ele- 
gance of the normal d ^ lucion which underlies classical test theorv. Thus, 

1. Data on individual t'^ item performance has been used, with c^nsio -^able 
success, in the Inte rnat ^al Study of Achievement in Mathematics , edit-l by 
T. Husen {t\e>t. York: Wile:, 1957). (^ock'hom, Almquist and Wiksell.^ 



raw scores (number ansv/ered correctly, rights ims wrongs) are converted to 

norrative scores, a procedure that eliminates ^ original units of rieasure- 
< 

r:^?nt. F-jti the purist's per'"^pective, individua- iten^ have even worse statis- 
tica pr-jperties, so they rust be lumped tog-etl^er in sufficient numbers :o 
r>rod:-:e " cales*' 'hat form the appropriately ' aped distribution wit* the 
crape- c-— -ee of "^tgI i abi 1 i ty . " 

-:^nd, : more important, is thai nor live measures ar' con^ ent 
with :he \ >/9 and meritocratic values the permeate so much A- -r-^-can 

society. In ' h\ > regard, it should be kept mind that the :i:illy 
admin^st:^-:. ^tr ltgence test, developed early in this century, -s a 

model fzr r^^ gr^up-administered tests used tocay. While the s re was 
not, 5tri«:-^ / speaking, a normative measure, it prompted the use ' r native 
jargon wii i a st*:^ong meritocratic connotation: cenius, superior, Jl ^al, 
imbecile, ind so forth. The earliest large-scale applications of -Qi;p jsting 
were in ne military, which, during the two w^^ Id wars, was c :err\'^^ with 
screening out illiterate and "mentally defective" recruits and --a-: — s and 
-identi^ ''ing candidates for officer training. These applications — e basically 
meri tor-atic: finding the "best" -and "worst" candidates. This lew of group 
tests ontinued after World War II when the cr:sh of applicants forced many 
colleg^e^ to institute screening procedures that cc :ld be applied on a large scale 
at rel " ively low cost. Normative scores pro.id-d a simple and seeniingly fair 
means identify the "best" students. This meritocratic view of testing has 
been r. -^forced by the competitveness of the colleges themselves, where the 
"best" cr-jileges are, of course, those with the highest scoring students. 

It :he 1950s and 1960s this meritocratic orientation was reinforced by 
competitiveness at the international level, when many Americans interpreted 
the first Soviet Sputnik to mean that the United States had slipped behind 



because m.r y of :s "b ightest'^ tudents v.^re not '^oing to college. One - 
manifestation o: . i? corcern was he 'Jation^vl Merit Scholarship Corporation, 
v/hich arnually c-ose ^o :9 rnillici students just to identify the 

1,500 or so. witT .> hfqhest sc^es who could be awarded scholarships to 
assure college at*: - -aance. Collec-75, of cour-se, became highly competitive in 
their quest for it S:nolars, d the number of Scholars in the student 
body was widely r--:^. ded 2s a sic^" academic quality. A sinilar competition 
jevelopec among z -^igh schools. 

Today, pos' us in acade^n- take the normative-meritocratic nature of 
■testing for gran: - . As a consec .rnce, concern with student change, ^^rowth, 
or development i: : bord^nated to -anking students from best to worst. This 
bias has infected grading practices, where instead of determining what stu- 
dents learn from the beginning tc the end of a course, or what they learn in 
college, grades are assignee which reflect primarily how students perform 
compared with each other at a g ven point in time. Employers and graduate 
schools further reinforce these evading practices, since the student's GPA is- 
a convenient means to identify tht- "best" students. ^ 

There is nc reason why c:^l leges should persist in these pr act ices 
other than habit and tradition. Using results from individual items and 
absolute or raw scores requires little additional effort, regardless of 
whether the test is a nationally standardized or individual classroom exami- 
nat ion . 

It is important to recognize that a meritocratic approach to selection 
of talent probably makes much better sense in other sectors of our society 
such as- the military, business and industry. What seems to Have happened 
is that college admissions officers now function more like personnel mana- 
gers in a commercial enterprise than like educators. Picking the "best" 



is in appropriate acri.ity for business and industry, since their goals are 
to lire the best tiilent to maximize productivity and profit. Similarly. 
cofTT-et: tion ariong rival companies for the pool of available talent is consi>- 
ten*- with the very nature of free enterprise economy. But the busines 
mode:, which fias been adopted by the most selective institutions, is not 
apprcT'^i ate to educarion. The college does not exist primarily to explolw 
taluiii. The mission - the college is not simply to maximize its output r--^ 
dis^ nuished alumni ry enrolling as many talented students as possible. Sucti 
a sTcLic process reduces the college to a kind of funnel: What comes out lis 
pur a matter of what goes in. Colleges and other educational i nsti tutiorr^s 
ex s: to develop talent^ to change the student, to contribute to personal 
d-- e'opment, to make a difference. 

If colleges and graduate and professional schools must limit tne 
n-mber of availab e spaces for applicants, then some kind of selection 
p*^ocess is obviously needed. The "value added" approach to education suggests 
t'.at ai:*]iissions procedures should be designed, at least in part, to select 
studenos who are likely to be influenced by the educational procels. The 
difference bet..?en the value added and the predictive approaches can be 
il iJ^str :ted w.L/f an analogy from the field of thoroughbred racing. Some 
years ago I put forward the argument that admissions officers in selective 
institutions function very much like handicappers at the Vace track: They 
try merely to pick winners. By looking. over the various candidates and 
evaluating their respective talents, they select those likely to perform 
well. But handicappers are interested only in p redicting the horse's perfor- 
mance, not in helping it to run better and faster. An educational institu- 
tion should function not like a handicapper, but like a jockey or trainer: 
It is responsible for improving the ^performance of the student, not merely 
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for identifying those v/ith the greatest potential. 

My argument so far involves two basic proposals for change: (a) 
first is that the meritocratic model of admissions be abandoned in favor of a 
value added approach and (b) that test scores, together with item and raw 
score data, be used instead to provide both s-tudents and teachers with feed- 
back concerning student growth and development. While this second proposal 
could probably be implemented tomorrow if the academic community were prepared 
to abandon its traditional grading system, the "value added" approach in the 
admissions process poses a number of unresolved problems. Most important is 
that there are probably no existing selection devices that adequately reflect 
the student*s potential to- benefit from the collegiate experience. Lacking 
such measures, it might be argued that we^ should stick to traditional admis- 
sions tests until something better is found. The problem with this argument 
is that the negative effects of tests in college admissions are not benign: 
they operate to the particular disadvantage of those minority groups who 
represent some of the most disadvantaged segments of our society. 

We recently conducted a simulation survey to deter mi ne just how 
•serious the handicap posed by college admissions tests really is. For 
example, if we are faced with a crush of ^applications where only one in ten 
of the applicants can tje admitted (such a situation is not unlike that found 
in many medical and law schools), relying solely on tests as our selection 
guide eliminates 99 percent ^f the black applicants and 97 percent of the 
Chicano applicants. In other words, a white student is three times more 
likely to be selected than a Chicano student, and ten times more likely 
than a black student. Clearly, some other approach to the admissions process 
should be found if these minority groups are ever to reach anything resem- 
bling educational parity with the white majority. 
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Let us now turn to the third educational practice that involves 
the use of standardized testing: tracking of students into different types of 
institutions. Public higher education in many states is arranged hierar- 
chically, with the major public university occupying the top rung, the 
state colleges (many formerly teachers colleges) the middle rung, and the 
bottom position being occupied by open-door community colleges. In some 
states, tests play a major role in determining which of these three options 
is available to the high school graduate. Generally speaking, the highest- 
scoring students can avail themselves of all three tiers, whereas the lowest- 
scoring students are allowed to enter only the community colleges. Not 
surprisingly, loiv income and miaority students tend to be underrepresented 
in the universities at the top ' of the hierarchy and heavily concentrated 
in the community co] leges . 

flow if these different types of institutions were roughly equivalent 
in their resources and offerings, one could argue that this type of ability 
tracking is not a denial of equal access. However, institutions at different 
positions in the hierarchy are by no means equivalent, so that the student's 
future may depend as much on the kind of institution attended as on attendance 
versus nonattendance. With the proliferation of public community colleges 
and. the substantial financial aid now available to needy students, the real 
issue of equality of access is not who goes to college, but who goes to 
college where . 

If one compares the educational resources of these three levels of 
institutions, in every respect they form a perfect hiierarchy with the univer- 
sities having the greatest resources and the community colleges the least. I 
am speaking here of such varied attributes as student-faculty ratios, faculty 
salaries, physical plant, libraries, and expenditures for general educational 
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purposes. Moreover, institutions at the top of the hierarchy are also much 
more likely to have another -asset: residential facilitiies. A considerable 
body of recent research (Astin-, 1977: Chickering, 197 ) shows that students, 
get more out of their undergraduate experience if they can live. on carnpus 
rather than commute from home. Finally, longitudinal studies show that a 
student's chance of persisting is substantially better at a -four-year college 
or university than at a community college. In short, these results suggest 
that the use of tests to track students into different types of institutions 
substantially reduces educational opport-unit ies for those students who tend 
to receive low scores. 

Is there any educat ional justification for the tracking-arrange- 
ment that" exists in most states? Perhaps the most common ^rationale offered 
for trac;<ing is that students will develop better academically if they are 
grouped with students of similar ability. In effect, this arqument assum.es 
a kind of aptitude-treatment interaction effect which h:i . ' important 

corrolary assumptions. First, high scoring students are : ■ j-- 'o.need the 
stimulation and competition of each other to realize their potential. 
Second, it is assumed that high scoring students will become bored and 
apathetic if grouped, with loyver-scoring students. And finally, it is argued 
that the lower-scoring student will become intimidated and discouraged if 
forced to compete with high scoring students. Considering the fundamental 
importance of such interaction assumptions, it is remarkable that so little 
research h*as been done to test them. What evidence there is, however, offers 
virtually no support. Thus, as far as learning outcomes are concerned, there 
seems to be little^^pr no interaction between the selectivity of the institu- 
tion and the ability of the student (Astin, 1968: liichols., 1954: Rock, Centra, 
and Linn, 1970). 
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My hunch is that institutional tracking is perpetuated less for 
educational reasons than. for reasons of meritocratic competition and staius. 
Professors in the universities at the top of the hierarchy support selective 
admissions because students with high test scores are easier to identify with 
and easier to teach. Indeed, even within a given classroom, professors 
probably favor thei^ most advanced . students. Selective admissions is sup- 
ported by alumni, legislators, faculty, administrators, and probably a 
great many students because a good input of highly motivated and talented 
students will al-most guarantee a good reputation and a good output of distin- 
guished and possibly wealthy alumni. The secondary schools support the track 
system that results from selective admissions because they see it as a reward 
or. incentive system to motivate their students: Teachers and guidance 
counselors frequently tell their students to study hard so they can get into a 
"good" institution'., ' . • • . 

Selettive admissions is also justified in these institutions on 
the grounds that any relaxation of admissions standards at the selective 
institutions would lower their academic standards. While such an outcome is 
indeed possible, it is by no means inevitable'. Academic standards have to do 
with the performance levels required before the institution wi 1 1 certify that 
the student has passed certain courses or completed certain requirements for 
the degree. There is no necessary reason why a modification in admissions 
standards has to. be accompanied by a modification, in certification standards. 

Perhaps the most important reason for hierarchical public systems is 
economic. In the minds; of many legislators and planners, the two-year 
college is-^an appealing way to expand access becai^se it is much less expensive 
than, other institutional forms. ^ At th^ same time, expanding the community 
college . al lows the more selective universities., to preserve their selectivity 
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and prestige and to avoid the pedagogical difficulties associated with 
teach ing^l-ess^wel 1 prepared students. Viewed from the perspective of the 
minorities, howeveV, these hierarchical public systems based on selective 
admissions represent a denial of equal opportunity. ^ ^ 

There would seem 'to be at least two ways to approach the problem of 
how. to remedy the unequal opportunity posed by hierarchical public systems. 
The first, and certainly the most unrealistic approach, would be to equalize 
resources and expenditures across the different types of institutions. If 
this means taking resources away from the universities to upgrade the com- 
munity 'col leges, the universities would never put up with it. If it means 
adding resources to the total higher education budget., within the state, the 
legislators and taxpayers would probably never support it. The second 
approach--which involves modifying admissions policies--s9ems like a much 
simpler and certainly less expensive solution. Nevertheless, if the univerr 
sities change their policies to admit larger numbers of underprepared stu- 
dents, additional resources will almost, certainly be required 'to provide 
adequate remedial assistance and other special support services. 

In conclus^ton, I suppose I find myself on tv/o sides of the testing, 
issue: advocating less dependence on tests for one purpose and more use of 
tests for another. Specifically, I think the time has 'corr.e to deemphasize 
the use of standardized tests in -screening and selection and to reconsider 
their potential for evaluating student progress and assessing the effectiye- 
ness of educational program^' at all levels. In research jargon, this means 
thai tests should.be used less as independent variables and more as dependent 
variables. At the same time, we should begin to consider retrieving some of 
the information we lose when we aggregate test items into scales and convert 
raw scores into normative scores. My impression is that the resistence to 
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standardized testing will decline substantially if people can come to repard 
these devices as sources of feedback to enhance the teaching-learning process, 
rather than as screening devices that limit educational opportunity. 

But what about the admissions process? -If the use of tests is to be 
minimized in the interest of expanding opportunities for minorities, and if 
the predictive model' is to be abandoned in favor of a "value added" model, how 
are institutions that have more applicants than available spaces supposed to 
make their selection decisions? While I have no magical solutions for this 
problem, -selective institutions in the public sector might want to consider 
experimenting wi th. alternative selection procedures identifying students with 
the greatest potential for growth or learning . The ideal sdlution, of 

cQijrse, would be to reshape our public systems to the point where there is a 
much closer correspondence ^between student demand and the supply of available 
places in various types of institutions. Such* a system, of course, would 
obviate the need for selective admissions. But until such a Utopian balance 
is achieved, the standardized'te's.t and the predictive model of admissions will 
continue to represent serious barriers to the educational development of 
minority students. If we are real ly- serious in our. i ntentions to expand access 
for minorities, alternatives to the standardized test must be found. \ 



